Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 824 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 116.0 KiB |
| Average record size in memory | 144.2 B |
Variable types
| CAT | 10 |
|---|---|
| NUM | 7 |
| BOOL | 1 |
Reproduction
| Analysis started | 2020-06-24 05:59:22.655732 |
|---|---|
| Analysis finished | 2020-06-24 05:59:30.717229 |
| Duration | 8.06 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Distinct count | 824 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20221.21359223301 |
|---|---|
| Minimum | 37 |
| Maximum | 41164 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.4 KiB |
Quantile statistics
| Minimum | 37 |
|---|---|
| 5-th percentile | 2130.3 |
| Q1 | 11112.5 |
| median | 19165.5 |
| Q3 | 29411.25 |
| 95-th percentile | 38966 |
| Maximum | 41164 |
| Range | 41127 |
| Interquartile range (IQR) | 18298.75 |
Descriptive statistics
| Standard deviation | 11449.16203 |
|---|---|
| Coefficient of variation (CV) | 0.5661955932 |
| Kurtosis | -1.071785952 |
| Mean | 20221.21359 |
| Median Absolute Deviation (MAD) | 9083.5 |
| Skewness | 0.08129332794 |
| Sum | 16662280 |
| Variance | 131083311.1 |
| Value | Count | Frequency (%) | |
| 16166 | 1 | 0.1% | |
| 12979 | 1 | 0.1% | |
| 10061 | 1 | 0.1% | |
| 2763 | 1 | 0.1% | |
| 33481 | 1 | 0.1% | |
| 22355 | 1 | 0.1% | |
| 40543 | 1 | 0.1% | |
| 2755 | 1 | 0.1% | |
| 1766 | 1 | 0.1% | |
| 23228 | 1 | 0.1% | |
| Other values (814) | 814 | 98.8% |
| Value | Count | Frequency (%) | |
| 37 | 1 | 0.1% | |
| 75 | 1 | 0.1% | |
| 88 | 1 | 0.1% | |
| 164 | 1 | 0.1% | |
| 199 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 41164 | 1 | 0.1% | |
| 41123 | 1 | 0.1% | |
| 41121 | 1 | 0.1% | |
| 40970 | 1 | 0.1% | |
| 40880 | 1 | 0.1% |
age
Real number (ℝ≥0)
| Distinct count | 54 |
|---|---|
| Unique (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.60800970873787 |
|---|---|
| Minimum | 19 |
| Maximum | 92 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.4 KiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 32 |
| median | 38 |
| Q3 | 46 |
| 95-th percentile | 57 |
| Maximum | 92 |
| Range | 73 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 10.29022275 |
|---|---|
| Coefficient of variation (CV) | 0.259801561 |
| Kurtosis | 0.7273298942 |
| Mean | 39.60800971 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.7771791336 |
| Sum | 32637 |
| Variance | 105.8886842 |
| Value | Count | Frequency (%) | |
| 32 | 47 | 5.7% | |
| 34 | 43 | 5.2% | |
| 31 | 39 | 4.7% | |
| 36 | 37 | 4.5% | |
| 33 | 34 | 4.1% | |
| 30 | 34 | 4.1% | |
| 43 | 33 | 4.0% | |
| 37 | 30 | 3.6% | |
| 29 | 30 | 3.6% | |
| 35 | 29 | 3.5% | |
| Other values (44) | 468 | 56.8% |
| Value | Count | Frequency (%) | |
| 19 | 2 | 0.2% | |
| 20 | 2 | 0.2% | |
| 21 | 1 | 0.1% | |
| 22 | 4 | 0.5% | |
| 23 | 7 | 0.8% |
| Value | Count | Frequency (%) | |
| 92 | 1 | 0.1% | |
| 78 | 1 | 0.1% | |
| 76 | 1 | 0.1% | |
| 74 | 2 | 0.2% | |
| 71 | 2 | 0.2% |
job
Categorical
| Distinct count | 12 |
|---|---|
| Unique (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| admin. | |
|---|---|
| blue-collar | |
| technician | |
| services | |
| management | |
| Other values (7) |
| Value | Count | Frequency (%) | |
| admin. | 210 | 25.5% | |
| blue-collar | 207 | 25.1% | |
| technician | 126 | 15.3% | |
| services | 67 | 8.1% | |
| management | 51 | 6.2% | |
| self-employed | 36 | 4.4% | |
| retired | 35 | 4.2% | |
| entrepreneur | 32 | 3.9% | |
| unemployed | 18 | 2.2% | |
| housemaid | 18 | 2.2% | |
| Other values (2) | 24 | 2.9% |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 9.041262136 |
| Min length | 6 |
marital
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| married | |
|---|---|
| single | |
| divorced | 81 |
| unknown | 2 |
| Value | Count | Frequency (%) | |
| married | 492 | 59.7% | |
| single | 249 | 30.2% | |
| divorced | 81 | 9.8% | |
| unknown | 2 | 0.2% |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.796116505 |
| Min length | 6 |
education
Categorical
| Distinct count | 8 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| university.degree | |
|---|---|
| high.school | |
| basic.9y | |
| professional.course | |
| basic.4y | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| university.degree | 238 | 28.9% | |
| high.school | 189 | 22.9% | |
| basic.9y | 127 | 15.4% | |
| professional.course | 93 | 11.3% | |
| basic.4y | 83 | 10.1% | |
| basic.6y | 58 | 7.0% | |
| unknown | 35 | 4.2% | |
| illiterate | 1 | 0.1% |
Length
| Max length | 19 |
|---|---|
| Median length | 11 |
| Mean length | 12.48907767 |
| Min length | 7 |
default
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| no | |
|---|---|
| unknown |
| Value | Count | Frequency (%) | |
| no | 664 | 80.6% | |
| unknown | 160 | 19.4% |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.970873786 |
| Min length | 2 |
housing
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| yes | |
|---|---|
| no | |
| unknown | 21 |
| Value | Count | Frequency (%) | |
| yes | 417 | 50.6% | |
| no | 386 | 46.8% | |
| unknown | 21 | 2.5% |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 2.633495146 |
| Min length | 2 |
loan
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| no | |
|---|---|
| yes | |
| unknown | 21 |
| Value | Count | Frequency (%) | |
| no | 659 | 80.0% | |
| yes | 144 | 17.5% | |
| unknown | 21 | 2.5% |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.302184466 |
| Min length | 2 |
comm_type
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| cellular | |
|---|---|
| telephone |
| Value | Count | Frequency (%) | |
| cellular | 547 | 66.4% | |
| telephone | 277 | 33.6% |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.336165049 |
| Min length | 8 |
month
Categorical
| Distinct count | 10 |
|---|---|
| Unique (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| may | |
|---|---|
| jul | |
| aug | |
| jun | |
| nov | |
| Other values (5) |
| Value | Count | Frequency (%) | |
| may | 245 | 29.7% | |
| jul | 186 | 22.6% | |
| aug | 110 | 13.3% | |
| jun | 101 | 12.3% | |
| nov | 80 | 9.7% | |
| apr | 62 | 7.5% | |
| oct | 17 | 2.1% | |
| sep | 13 | 1.6% | |
| dec | 6 | 0.7% | |
| mar | 4 | 0.5% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
day_of_week
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| thu | |
|---|---|
| fri | |
| wed | |
| tue | |
| mon |
| Value | Count | Frequency (%) | |
| thu | 186 | 22.6% | |
| fri | 171 | 20.8% | |
| wed | 167 | 20.3% | |
| tue | 157 | 19.1% | |
| mon | 143 | 17.4% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
last_contact_duration
Real number (ℝ≥0)
| Distinct count | 500 |
|---|---|
| Unique (%) | 60.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1406.2087378640776 |
|---|---|
| Minimum | 1053 |
| Maximum | 4918 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.4 KiB |
Quantile statistics
| Minimum | 1053 |
|---|---|
| 5-th percentile | 1068.3 |
| Q1 | 1142.75 |
| median | 1271.5 |
| Q3 | 1500.5 |
| 95-th percentile | 2177.25 |
| Maximum | 4918 |
| Range | 3865 |
| Interquartile range (IQR) | 357.75 |
Descriptive statistics
| Standard deviation | 432.513432 |
|---|---|
| Coefficient of variation (CV) | 0.3075741321 |
| Kurtosis | 13.30726681 |
| Mean | 1406.208738 |
| Median Absolute Deviation (MAD) | 153.5 |
| Skewness | 3.065087859 |
| Sum | 1158716 |
| Variance | 187067.8689 |
| Value | Count | Frequency (%) | |
| 1106 | 6 | 0.7% | |
| 1156 | 5 | 0.6% | |
| 1120 | 5 | 0.6% | |
| 1130 | 5 | 0.6% | |
| 1063 | 5 | 0.6% | |
| 1206 | 5 | 0.6% | |
| 1080 | 5 | 0.6% | |
| 1161 | 5 | 0.6% | |
| 1081 | 4 | 0.5% | |
| 1210 | 4 | 0.5% | |
| Other values (490) | 775 | 94.1% |
| Value | Count | Frequency (%) | |
| 1053 | 1 | 0.1% | |
| 1054 | 1 | 0.1% | |
| 1055 | 2 | 0.2% | |
| 1056 | 3 | 0.4% | |
| 1057 | 2 | 0.2% |
| Value | Count | Frequency (%) | |
| 4918 | 1 | 0.1% | |
| 4199 | 1 | 0.1% | |
| 3785 | 1 | 0.1% | |
| 3643 | 1 | 0.1% | |
| 3631 | 1 | 0.1% |
campaign_contact_count
Real number (ℝ≥0)
| Distinct count | 18 |
|---|---|
| Unique (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.591019417475728 |
|---|---|
| Minimum | 1 |
| Maximum | 26 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 26 |
| Range | 25 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.296696752 |
|---|---|
| Coefficient of variation (CV) | 0.8864066153 |
| Kurtosis | 22.88968255 |
| Mean | 2.591019417 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.782941169 |
| Sum | 2135 |
| Variance | 5.27481597 |
| Value | Count | Frequency (%) | |
| 1 | 280 | 34.0% | |
| 2 | 251 | 30.5% | |
| 3 | 139 | 16.9% | |
| 4 | 67 | 8.1% | |
| 5 | 27 | 3.3% | |
| 6 | 19 | 2.3% | |
| 7 | 12 | 1.5% | |
| 10 | 6 | 0.7% | |
| 9 | 6 | 0.7% | |
| 8 | 4 | 0.5% | |
| Other values (8) | 13 | 1.6% |
| Value | Count | Frequency (%) | |
| 1 | 280 | 34.0% | |
| 2 | 251 | 30.5% | |
| 3 | 139 | 16.9% | |
| 4 | 67 | 8.1% | |
| 5 | 27 | 3.3% |
| Value | Count | Frequency (%) | |
| 26 | 1 | 0.1% | |
| 19 | 1 | 0.1% | |
| 17 | 2 | 0.2% | |
| 15 | 1 | 0.1% | |
| 14 | 1 | 0.1% |
poutcome
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 KiB |
| nonexistent | |
|---|---|
| failure | 62 |
| success | 26 |
| Value | Count | Frequency (%) | |
| nonexistent | 736 | 89.3% | |
| failure | 62 | 7.5% | |
| success | 26 | 3.2% |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.57281553 |
| Min length | 7 |
cons.price.idx
Real number (ℝ≥0)
| Distinct count | 25 |
|---|---|
| Unique (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93.60324999999999 |
|---|---|
| Minimum | 92.20100000000001 |
| Maximum | 94.76700000000001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.4 KiB |
Quantile statistics
| Minimum | 92.201 |
|---|---|
| 5-th percentile | 92.843 |
| Q1 | 93.075 |
| median | 93.918 |
| Q3 | 93.994 |
| 95-th percentile | 94.465 |
| Maximum | 94.767 |
| Range | 2.566 |
| Interquartile range (IQR) | 0.919 |
Descriptive statistics
| Standard deviation | 0.5638976068 |
|---|---|
| Coefficient of variation (CV) | 0.006024337903 |
| Kurtosis | -0.7758244425 |
| Mean | 93.60325 |
| Median Absolute Deviation (MAD) | 0.474 |
| Skewness | -0.305689062 |
| Sum | 77129.078 |
| Variance | 0.3179805109 |
| Value | Count | Frequency (%) | |
| 93.918 | 181 | 22.0% | |
| 93.994 | 137 | 16.6% | |
| 92.893 | 105 | 12.7% | |
| 93.444 | 92 | 11.2% | |
| 94.465 | 87 | 10.6% | |
| 93.2 | 71 | 8.6% | |
| 93.075 | 57 | 6.9% | |
| 92.201 | 11 | 1.3% | |
| 92.431 | 11 | 1.3% | |
| 92.963 | 10 | 1.2% | |
| Other values (15) | 62 | 7.5% |
| Value | Count | Frequency (%) | |
| 92.201 | 11 | 1.3% | |
| 92.379 | 5 | 0.6% | |
| 92.431 | 11 | 1.3% | |
| 92.469 | 2 | 0.2% | |
| 92.649 | 6 | 0.7% |
| Value | Count | Frequency (%) | |
| 94.767 | 3 | 0.4% | |
| 94.601 | 2 | 0.2% | |
| 94.465 | 87 | 10.6% | |
| 94.215 | 3 | 0.4% | |
| 94.199 | 8 | 1.0% |
cons.conf.idx
Real number (ℝ)
| Distinct count | 25 |
|---|---|
| Unique (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -40.74526699029127 |
|---|---|
| Minimum | -50.8 |
| Maximum | -26.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.4 KiB |
Quantile statistics
| Minimum | -50.8 |
|---|---|
| 5-th percentile | -47.1 |
| Q1 | -42.7 |
| median | -42 |
| Q3 | -36.4 |
| 95-th percentile | -34.6 |
| Maximum | -26.9 |
| Range | 23.9 |
| Interquartile range (IQR) | 6.3 |
Descriptive statistics
| Standard deviation | 4.515199327 |
|---|---|
| Coefficient of variation (CV) | -0.1108153084 |
| Kurtosis | -0.01263063113 |
| Mean | -40.74526699 |
| Median Absolute Deviation (MAD) | 4.2 |
| Skewness | 0.4983707218 |
| Sum | -33574.1 |
| Variance | 20.38702496 |
| Value | Count | Frequency (%) | |
| -42.7 | 181 | 22.0% | |
| -36.4 | 137 | 16.6% | |
| -46.2 | 105 | 12.7% | |
| -36.1 | 92 | 11.2% | |
| -41.8 | 87 | 10.6% | |
| -42 | 71 | 8.6% | |
| -47.1 | 57 | 6.9% | |
| -26.9 | 11 | 1.3% | |
| -31.4 | 11 | 1.3% | |
| -40.8 | 10 | 1.2% | |
| Other values (15) | 62 | 7.5% |
| Value | Count | Frequency (%) | |
| -50.8 | 3 | 0.4% | |
| -50 | 3 | 0.4% | |
| -49.5 | 2 | 0.2% | |
| -47.1 | 57 | 6.9% | |
| -46.2 | 105 | 12.7% |
| Value | Count | Frequency (%) | |
| -26.9 | 11 | 1.3% | |
| -29.8 | 5 | 0.6% | |
| -30.1 | 6 | 0.7% | |
| -31.4 | 11 | 1.3% | |
| -33 | 6 | 0.7% |
nr.employed
Real number (ℝ≥0)
| Distinct count | 10 |
|---|---|
| Unique (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5171.220388349514 |
|---|---|
| Minimum | 4963.6 |
| Maximum | 5228.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.4 KiB |
Quantile statistics
| Minimum | 4963.6 |
|---|---|
| 5-th percentile | 5017.5 |
| Q1 | 5099.1 |
| median | 5195.8 |
| Q3 | 5228.1 |
| 95-th percentile | 5228.1 |
| Maximum | 5228.1 |
| Range | 264.5 |
| Interquartile range (IQR) | 129 |
Descriptive statistics
| Standard deviation | 71.69256184 |
|---|---|
| Coefficient of variation (CV) | 0.01386376067 |
| Kurtosis | 0.2462926255 |
| Mean | 5171.220388 |
| Median Absolute Deviation (MAD) | 32.3 |
| Skewness | -1.152195025 |
| Sum | 4261085.6 |
| Variance | 5139.823423 |
| Value | Count | Frequency (%) | |
| 5228.1 | 360 | 43.7% | |
| 5099.1 | 165 | 20.0% | |
| 5191 | 137 | 16.6% | |
| 5195.8 | 75 | 9.1% | |
| 5076.2 | 23 | 2.8% | |
| 5017.5 | 22 | 2.7% | |
| 4991.6 | 14 | 1.7% | |
| 4963.6 | 13 | 1.6% | |
| 5008.7 | 9 | 1.1% | |
| 5023.5 | 6 | 0.7% |
| Value | Count | Frequency (%) | |
| 4963.6 | 13 | 1.6% | |
| 4991.6 | 14 | 1.7% | |
| 5008.7 | 9 | 1.1% | |
| 5017.5 | 22 | 2.7% | |
| 5023.5 | 6 | 0.7% |
| Value | Count | Frequency (%) | |
| 5228.1 | 360 | 43.7% | |
| 5195.8 | 75 | 9.1% | |
| 5191 | 137 | 16.6% | |
| 5099.1 | 165 | 20.0% | |
| 5076.2 | 23 | 2.8% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | age | job | marital | education | default | housing | loan | comm_type | month | day_of_week | last_contact_duration | campaign_contact_count | poutcome | cons.price.idx | cons.conf.idx | nr.employed | cluster | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 37 | 52 | technician | married | basic.9y | no | yes | no | telephone | may | mon | 1666 | 1 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
| 1 | 75 | 41 | blue-collar | divorced | basic.4y | unknown | yes | no | telephone | may | mon | 1575 | 1 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
| 2 | 88 | 49 | technician | married | basic.9y | no | no | no | telephone | may | mon | 1467 | 1 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
| 3 | 164 | 39 | services | divorced | high.school | unknown | no | no | telephone | may | mon | 2033 | 1 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
| 4 | 199 | 43 | blue-collar | married | basic.6y | no | yes | no | telephone | may | mon | 1077 | 1 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
| 5 | 388 | 28 | unknown | single | unknown | unknown | yes | yes | telephone | may | tue | 1201 | 1 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
| 6 | 446 | 42 | technician | married | professional.course | no | no | no | telephone | may | tue | 1623 | 1 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
| 7 | 469 | 42 | management | married | university.degree | no | no | no | telephone | may | tue | 1677 | 1 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
| 8 | 556 | 42 | blue-collar | married | high.school | no | no | yes | telephone | may | tue | 1297 | 3 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
| 9 | 590 | 32 | technician | married | professional.course | no | no | no | telephone | may | tue | 1906 | 3 | nonexistent | 93.994 | -36.4 | 5191.0 | 1 |
Last rows
| df_index | age | job | marital | education | default | housing | loan | comm_type | month | day_of_week | last_contact_duration | campaign_contact_count | poutcome | cons.price.idx | cons.conf.idx | nr.employed | cluster | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 814 | 40672 | 45 | unemployed | married | professional.course | unknown | no | no | telephone | sep | thu | 1405 | 1 | failure | 94.199 | -37.5 | 4963.6 | 1 |
| 815 | 40730 | 60 | retired | married | high.school | no | no | no | cellular | sep | wed | 1640 | 1 | nonexistent | 94.199 | -37.5 | 4963.6 | 1 |
| 816 | 40764 | 36 | technician | married | university.degree | no | unknown | unknown | cellular | sep | thu | 1334 | 2 | nonexistent | 94.199 | -37.5 | 4963.6 | 1 |
| 817 | 40836 | 30 | student | single | professional.course | no | yes | no | cellular | sep | mon | 1616 | 4 | success | 94.199 | -37.5 | 4963.6 | 1 |
| 818 | 40838 | 32 | admin. | married | high.school | no | yes | no | cellular | sep | mon | 1298 | 1 | nonexistent | 94.199 | -37.5 | 4963.6 | 1 |
| 819 | 40880 | 28 | admin. | single | high.school | no | no | no | cellular | oct | wed | 1246 | 2 | nonexistent | 94.601 | -49.5 | 4963.6 | 1 |
| 820 | 40970 | 24 | admin. | single | university.degree | no | yes | no | cellular | oct | fri | 1176 | 3 | success | 94.601 | -49.5 | 4963.6 | 1 |
| 821 | 41121 | 46 | admin. | single | university.degree | no | yes | no | cellular | nov | tue | 1166 | 3 | failure | 94.767 | -50.8 | 4963.6 | 1 |
| 822 | 41123 | 36 | blue-collar | single | basic.6y | no | no | no | cellular | nov | tue | 1556 | 4 | nonexistent | 94.767 | -50.8 | 4963.6 | 1 |
| 823 | 41164 | 54 | admin. | married | professional.course | no | no | no | cellular | nov | tue | 1868 | 2 | success | 94.767 | -50.8 | 4963.6 | 1 |